Efficient Skyline Computation on Massive Incomplete Data
نویسندگان
چکیده
Abstract Incomplete skyline query is an important operation to filter out pareto-optimal tuples on incomplete data. It harder than due intransitivity and cyclic dominance. analyzed that the existing algorithms cannot process massive data efficiently. This paper proposes a novel table-scan-based TSI algorithm deal with high efficiency. solves issues of dominance by two separate stages. In stage 1, computes candidates sequential scan table. The dominated others are discarded directly in 1. 2, refines another scan. pruning devised this reduce execution cost TSI. By assistant structures, can skip majority phase 1 without retrieving it actually. extensive experimental results, which conducted synthetic real-life sets, show compute
منابع مشابه
Skyline View: Efficient Distributed Subspace Skyline Computation
Skyline queries have gained much attention as alternative query semantics with pros (e.g.low query formulation overhead) and cons (e.g.large control over result size). To overcome the cons, subspace skyline queries have been recently studied, where users iteratively specify relevant feature subspaces on search space. However, existing works mainly focuss on centralized databases. This paper aim...
متن کاملEfficient Progressive Skyline Computation
In this paper, we focus on the retrieval of a set of interesting answers called the skyline from a database. Given a set of points, the skyline comprises the points that are not dominated by other points. A point dominates another point if it is as good or better in all dimensions and better in at least one dimension. We present two novel algorithms, Bitmap and Index, to compute the skyline of ...
متن کاملSkyline Computation on Commercial Data
• Our data set contains data on 55208 cars [1]. • To each car, 23 attributes are assigned. – correlated (e.g., cylinders and engine size). – anti-correlated (e.g., mileage and registration date). – nearly independent (e.g., mileage and horsepower). • Outliers countervail correlation effects. • Cardinalities differ greatly, e.g.: – 5988 different values for attribute price. – only 17 different v...
متن کاملEfficient Skyline Computation in MapReduce
Skyline queries are useful for finding interesting tuples from a large data set according to multiple criteria. The sizes of data sets are constantly increasing and the architecture of back-ends are switching from single-node environments to non-conventional paradigms like MapReduce. Despite the usefulness of skyline queries, existing works on skyline computation in MapReduce do not take full a...
متن کاملEfficient computation of combinatorial skyline queries
Current skyline evaluation techniques are mainly to find the outstanding tuples from a large dataset. In this paper, we generalize the concept of skyline query and introduce a novel type of query, the combinatorial skyline query, which is to find the outstanding combinations from all combinations of the given tuples. The past skyline query is a special abundant when used in decision making, mar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Data Science and Engineering
سال: 2022
ISSN: ['2364-1541', '2364-1185']
DOI: https://doi.org/10.1007/s41019-022-00183-7